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Abstract. We use Prolog as a flexible meta-language to provide ex- 
ecutable specifications of some fundamental mathematical objects and 
their transformations. In the process, isomorphisms are unraveled be- 
tween natural numbers and combinatorial objects (rooted ordered trees 
representing hereditarily finite sequences and rooted ordered binary trees 
representing Godel's System T types). 

This paper focuses on an application that can be seen as an unex- 
pected "paradigm shift": we provide recursive definitions showing that 
the resulting representations are directly usable to perform symbolically 
arbitrary-length integer computations. 

Besides the theoretically interesting fact of "breaking the arithmetic/sym- 
bolic barrier" , the arithmetic operations performed with symbolic objects 
like trees or types turn out to be genuinely efficient - we derive implemen- 
tations with asymptotic performance comparable to ordinary bitstring 
implementations of arbitrary-length integer arithmetic. 
The source code of the paper, organized as a literate Prolog program, is 
available at |http : //logic . cse .unt . edu/tcLrau/resecLrch/2 011/pPAR .pll 
Keywords: modeling finite mathematics in logic programming, symbolic 
arbitrary precision arithmetic, ranking /unranktng of hereditarily finite 
sequences, balanced parenthesis languages. 



1 Introduction 

This paper exhibits a creative use of logic programming as a modeling tool 
for several interesting concepts at the intersection of combinatorics, formal lan- 
guages, foundation of mathematics and coding theory. It builds on the declar- 
ative data transformation framework introduced in [112] . where we introduce 
a methodology to derive bijective mappings between fundamental data types 
used in programming languages (sets, multisets, sequences to graphs, digraphs, 
DAGs, hypergraphs etc.) 

At the same time, with practical uses for arbitrary size integer arithmetic in 
mind, we will focus on keeping the asymptotic complexity of various operations 
similar to that of operations on conventional bitstrings. 

Like [T] , this paper is organized as a literate Prolog program. This means that 
our "lingua franca" is logic programming rather than the usual mathematical 
notation. 



It has been a long tradition in logic programming to model program prop- 
erties and behaviors in terms of mathematical reasoning. We pay it back this 
time, and model some intriguing mathematical concepts as logic programs. 

The paper is organized as follows. 

Section [2] overviews, following ^ a bijection between natural numbers and 
sequences that is extended in section [3j by recursive application, to hereditarily 
finite sequences. Section |4] describes a novel way to perform arbitrary length 
arithmetic computations using multiway tree representations of hereditarily fi- 
nite sequences and discusses some potential applications for implementation of 
arithmetic operations with numbers that do not fit in computer memory with 
conventional binary encodings. It is followed by a sketch of similar mechanism in 
section [5] for the type language of Godel's system T. Section [6] introduces a bi- 
jection between hereditarily finite sequences and balanced parenthesis languages 
providing a succinct representation for them. Sections [7] and |8] discuss related 
work and conclude the paper. 

2 A bijection between finite sequences and natural 
numbers 

Let N be the set of natural numbers and [N] the set of finite sequences of natural 
numbers (that can also be seen as the set of functions from an initial segment 
of N to N - or even more generally, as finite functions). We will first derive, 
following [T] a bijection N -J> [N]. 

We define the following predicates working on natural numbers: 

cons(X,Y,XY) :-X>=0,Y^,XY is (1+(Y«1))«K. 
hd(XY,X) :-XY>0,P is XY /\ 1 ,hdl (P,XY,X) . 
hdl(l,_,0) . 

hdl(0,XY,X) :-Z is XY»1 ,hd(Z ,H) ,X is Sfl . 
tl(XY,Y) :-hd(XY,X) ,Y is XY»(X-|-1) . 
null(O) . 

After observing that the relations cons (X,Y,Z) , hd(Z,X), tl(Z,Y) hold if and 
only if Z = 2^ (2Y -1-1), it can be proven by structural induction that: 

Proposition 1 The predicates cons/3, hd/2, tl/2, null/1 emulate the list 
functions CONS,CAR,CDR,NIL as defined in 0/ (see proof in f^). 

Using these predicates we define a bijection between finite sequences repre- 
sented as lists of their values and natural numbers 

list2nat( [] ,0) . 

list2nat([XlXs] ,N) :-list2nat(Xs,Nl) , cons (X,N1 ,N) . 



nat21ist(0, [] ) . 

nat21ist(N, [X|Xs]) :-N>0,hd(N,X) ,tl(N,T) ,nat21ist(T,Xs) . 



working as follows: 

?- nat21ist(2012,Ns) ,list2nat (Ns ,N) . 
Ns = [2, 0, 0, 1, 0, 0, 0, 0] , 
N = 2012 



3 Ranking Hereditarily Finite Sequences 

Definition 1 The ranking problem for a family of combinatorial objects is finding a 
unique natural number associated to each object, called its rank. The inverse unranking 
problem consists of generating a unique combinatorial object associated to each natural 
number. 

Definition 2 A hereditarily finite sequence is [] or a finite sequence of hereditarily 
finite sequences. 

We will describe, by instantiating the data type transformation described in [1] 
how to extend a bijection N — >■ [N] to trees representing hereditarily finite sequences. 
The two sides of the bijection are expressed as two higher order predicates rank and 
unrank parameterized by two transformations F and G: 

unrank(F,N,Rs) : -call(F,N,Ns) ,maplist (unrank (F) ,Ns,Rs) . 
rank(G,Ts,Rs) : -maplist (rank(G) ,Ts,Xs) ,call(G,Xs,Rs) . 



These predicates can be seen as a form of "structured recursion" that propagate a 
simpler operation (F and G) guided by the structure of the underlying data type. We 
can instantiate this mechanism to derive a bijection between natural numbers and trees 
representing hereditarily finite sequences using rank and unrank as: 

nat2hfseq(N,T) : -unrank (nat21ist ,N,T) . 
hf seq2nat(T,N) : -rank(list2nat ,T,N) . 



They work as follows: 

?- nat2hfseq(2012,HFSEQ) ,hf seq2nat (HFSEQ ,N) . 
HFSEq = [[[[]]], [], [], [[]], [], [], [], []], 
N = 2012 

One can represent the recursive unfolding of a natural number into a hereditarily finite 
sequence as a directed ordered multigraph (Fig.[T]). Note that as the mapping nat21ist 
generates a sequence where the order of the edges matters, this order is indicated with 
integers starting from labeling the edges. 



Fig. 1: 2012 as a HFSEQ 



4 Computing with herediteirily finite sequences 

This section describes a surprising possibility derived from tlie existence of bijections 
between various data types and natural numbers. It answers positively the following 
question: can we turn such bijections into actual isomorphisms such that operations 
like additions or multiplications defined on symbolic objects (e.g. trees or parenthesis 
languages) mimic their natural number equivalents? Moreover, we want a genuinely 
constructive proof that this can be done, which means that we need to build inductive 
definitions, starting with successor and predecessor and then extend them to implement 
everything else. 

We will build these operations incrementally. We start with successor/predecessor 
operations and simple (but slow) mappings to natural numbers. We then provide ef- 
ficient implementations, working, like in the case of bitstring representations, in time 
proportional to the size of the operands. 



4.1 Successor and predecessor 

To derive efficient successor and predecessor operations we recall that the equation 
Z=[X|Y] on hereditarily finite sequences corresponds bijectively to the equation 

Z = 2^{2Y + l) (1) 

on natural numbers. Successor and predecessor predicates s/2 and p/2 are defined as: 



s([], [[]]). 

s([[K|Ks] |Xs] , [[] ,Kl|Xs]):-p([K|Ks] ,K1) . 

s ( [ [] I Xs] , [ [Kl I Ks] I Ys] ) : -s (Xs , [K | Ys] ) , s (K , [Kl | Ks] ) . 

?([[]],[]). 

p([[] ,K|Xs] , [[Kl|Ks] |Xs]):-s(K, [KllKs]) . 

p([[K|Ks] |Xs] , [[] |Zs]):-p([K|Ks] ,K1) ,p( [Kl | Xs] ,Zs). 



The two predicates are deterministic and implement functions when their first argu- 
ments are ground, given that the patterns used in the heads of the rules share no 



instances. If executed under a breadth-first evaluation rule (or if impure Prolog opera- 
tions are used) the two predicates can be merged into a single reversible predicate. We 
have preferred pure Horn clause definitions, however, and reordered the goals in the 
clause bodies as needed. 

When navigating over hereditarily finite sequence trees, s/2 implements tree trans- 
formations such that the following propositions hold: 

Proposition 2 If T is such that hf seq2nat(T,N) , s(T,Tl) and hf seq2nat (Tl ,N1) 
hold, then Nl is N+1. 

Proposition 3 //T (assumed different from []) is such f/iaf hf seq2nat(T,N), p(T,Tl) 
and hfseq2nat(Tl,Nl) hold, then Nl is N-1. 

One can rephrase this saying that the pair hfseq2nat and nat2hfseq acts as a iso- 
functor that transports successor and predecessor operations between natural numbers 
and hereditarily finite sequences. A proof is obtained by structural induction on the 
first argument of the two predicates after defining a mapping between a multiway tree 
type and a natural number type supporting an axiomatization of Peano arithmetic. 
Note that by replacing [] by and each relation of the form [X I Y] =Z in the induc- 
tive definition of s and t with equations of the form 2^ * (2 * Y + 1) ~ Z one can 
obtain arithmetic formulas that, after simplifications result in the the usual arithmetic 
relations defining s and p. 

One can prove the correctness of s and p with respect to the corresponding successor 
and predecessor operations on N, by verifying that when interpreting each constructor 
in terms of equation [l] on N the resulting formulas become identities. 

For instance, s ([],[[]] ) becomes s(0, 2° * (2 * -I- 1)) and then s(0, 1) which states 
that the successor of is 1. 

On the other hand the second and third recursive equations in the definitions of 
s and p become logical implications between arithmetic identities, relatively easy to 
prove through a sequence of simplifications. 

For instance, the second equation in the definition of s/2 becomes, after putting 
[iC|/Cs] -> x,Xs y, Kl ^ z with x,y,z £ N. 

s{[x\y\,[Q,z\y]):~p{x,z). (2) 

After interpreting : - as inverse logical implication we obtain 

5(2"= * (2 * y + 1), 2" * 2 * (2^ * (2 * y + 1)) + 1) ^ p{x, z). (3) 

After interpreting s and p as successor and predecessor on N we obtain: 

1 + (2"= * (2 * y + 1) = 2 * 2^ * (2 * y + 1) 1 ^ (a; = z -Hi). (4) 

After replacing a; by 2 + 1 on the left side we obtain: 

2"^+^ * (2 *y-f- 1) = 2^+^ * (2 *y + 1) (5) 

which is clearly an identity in N. 

Note that the ability to reason about the correctness of our programs has been 
clearly facilitated by the declarative semantics of Prolog, for instance when interpreting 
:- as reverse logical implication. 

Let us now define, using s/2 and p/2 a simple (but inefficient) bijection from trees 
(with leaves made of empty lists) to ordinary natural numbers: 



tree2nat([] ,0) . 




tree2nat ( [X 1 Xs] ,N) 


-p([X|Xs] ,Y) ,tree2nat(Y,M) ,N is 1 . 


nat2tree(0, []) . 




nat2tree(N, [X|Xs]) 


-N>0,M is N-l,nat2tree(M,Y),s(Y,[X|Xs]). 



working as follows: 



?- nat2tree(2012,T) ,tree2nat(T,N) . 

T = [[[[]]]. []. [], [[]], []. [], [], []]. 

N = 2012 ; 

After defining a generator for the infinite stream of hereditarily finite sequences mapped 

to successive natural numbers 

n([]). 

n(S) :-n(P) ,s(P,S) . 



one can confirm empirically that our two symbolic s/2 and p/2 operations provide 
indeed emulations of their standard counterparts: 

?- n(X) ,tree2nat(X,N) . 
X = [] , N = ; 
X = [[]] , N = 1 ; 
X = [[[]]], N = 2 ; 
X = [[], []], N = 3 ; 



4.2 Simple arithmetic operations in terms of successor and 
predecessor 

The s/2 and p/2 predicate pair can be used to implement the usual arithmetic oper- 
ations in time (N) where N is the natural number corresponding to the first operand. 
For instance, addition can be defined as follows: 

slow_add([] ,X,X) . 

slow.add ( [X I Xs] , Y , Z) : -p ( [X I Xs] , P) , s (Y , Yl ) , slow.add (P , Yl , Z) . 



It works indeed as expected: 

?- nat2tree(42,T) ,slow_add(T,T,R) ,tree2nat (R,N) . 

T = [[[]], [[]]. [[]]], R = [[[[]]], [[]], [[]]], N = 84 

We will next define efficient operations, with asymptotic complexity comparable to 
typical bignum packages provided by various languages. 

4.3 Basic recognizers and constructors 



We start with recognizers for odd numbers o_/2, strictly positive even numbers i_/2 
and zero e_/l. 



o_([[] !_]). 

i_([[_|_] |_]). 
e_([]) . 

Next, we define our constructors. The first one, o/2 builds odd numbers, as if provided 
by leftshift+increment operation 2*X+1 The later applies the successor predicate to the 
result of the first, as if provided by the 2*X+2 operation. 

o(X,[[] |X]). 
i(X,Y):-s([[] |X] .Y). 

Note that the predicate e_/l can also be seen as a constructor for the empty list 
representing 0. 

4.4 Arithmetic operations with hereditarily finite sequences — 
efficiently 

To provide cflicicnt, possibly practical implementations of arithmetic operations, we 
will need a few more steps towards emulating binary representations including variants 
of left and right shifting operations. 

Deconstructing Let us first build a deconstructor r/2, working as a decrement + 
rightshift operation on bitstrings such that it maps both 2*X+1 and 2*X+2 to X, i.e. 
such that it reverses the action of the constructors o/2 and i/2. 

r([[] |Xs],Xs). 

r([[X|Xs] |Ys] ,Rs) :-p([[X|Xs] | Ys] , [[] | Rs] ) . 

Note that the first clause maps to n a term corresponding to an odd number of the form 
2*n+l, while the second applies the predecessor to an even number while trimming the 
result (an odd number) in a similar way to the first clause. 

Converting back and forth Given the deconstructor r/2 and the constructors o/2 

and i/2, wc can empirically validate the intuitions behind our symbolic representations, 
by mapping them one-to-one to conventional natural numbers. 

We first define a converter s2n/2, mapping tree representations of hereditarily finite 
sequences to conventional natural numbers: 

s2n([],0). 

s2n(X,R):-o_(X),r(X,S),s2n(S,N),R is 1+2*N. 
s2n(X,R):-i_(X),r(X,S),s2n(S,N),R is 2H-2*N. 

tlicii a converter n2s/2 from natural imuil)ers to our .syiiil)olic reiireseutatious: 
n2s(0, []) . 

n2s(N,R) :-N>0,P is N mod 2,N1 is (N-1) // 2, 
n2s(Nl,X) , 
( P=:=0^i(X,R) 
; o(X,R) 
). 



They work as expected, and s2n can be seen as enumerating the stream of natural 
numbers correctly. 



?- n2s(42,S) ,s2n(S,N) . 
S = [[[]], [[]], [[]]], 
N = 42 

?-n(X) ,s2ii(X,N) . 
X = [] , N = ; 
X = [[]], N = 1 ; 
X = [[[]]], N = 2 ; 
X = [[], []], N = 3 ; 



Note also that they work in time proportional to the size of the representations. 

Efficient Addition Guided by this mapping, that sees our symbolic representations 

as if they were bitstrings in bvjective base-2, wc can implement an addition operation 
working in time proportional to the size of the operands: 

a([],Y,Y). 

a([X|Xs].[].[X|Xs]). 

a(X,Y,Z) :-o_(X) ,o_(Y) ,al(X,Y,R) , i(R,Z) . 

a(X,Y,Z) :-o_(X) ,i_(Y) ,al(X,Y,R) , a2(R,Z) . 

a(X,Y,Z) :-i_(X) ,o_(Y) ,al(X,Y,R) , a2(R,Z) . 

aCX.Y.Z) :-i_(X) .i_(Y) ,al(X,Y.R) . s(R.S) , i (S,Z) . 

al(X,Y,R) :-r(X,RX) ,r(Y,RY) ,a(RX,RY,R) . 
a2(R,Z) :-s(R,S) ,o(S,Z) . 



working instantly on arbitrarily large natural numbers: 

?-n2s ( 1234567890 1234567890 , A) , n2s (10000000000000000000 ,B),a(A,B,S),s2n(S.N). 

A= [[[]], [[[]]], [[]], [], [[]], [[]], [[[...]]], [], []|...], 

B= [[[], [], [[[]]]], [[]], [], [], [], [[[]]], [[], []], [[...]]...!...], 

S= [[[]], [[[]]], [[]], [], [[]], [[]], [[[...]]], [], []|...], 

N = 22345678901234567890 . 

Efficient Multiplication We can implement efficient multiplication guided by in- 
tuitions about binary multiplication in base 2 and bijective-base 2 as follows: 

m([], _,[]). 
m(_, [],[]). 

m(X,Y,Z) :-p(X,Xl) ,p(Y,Yl) ,mO(Xl,Yl,Zl) ,s(Zl,Z) . 
mO([] ,Y,Y) . 

mO([[] |X] ,Y, [[] |Z]):- mO(X,Y,Z) . 

mO(X,Y, Z) :-i_(X) ,r(X.Xl) ,mO(Xl,Y,Zl) ,a(Y, [[] |Z1] ,Y1) ,s(Yl,Z) . 



One can see that it handles easily large numbers (the googol= 10^°° included!): 



?-n2s(12345678901234567890,A) ,n2s( 10000000000000000000, B) ,m(A,B,S) ,s2n(S,N) . 

A= [[[]], [[[]]], [[]], [], [[]], [[]], [[[...]]], [], []|...], 

B= [[[], [], [[[]]]], [[]], [], [], [], [[[]]], [[], []], [[...]]|...], 

S= [[[[[]]], [[]]], [[]], [[], [[]]], [], [[[[]]]], [[]], □!...], 

N = 123456789012345678900000000000000000000 . 

?- n2s((10'100) .Googol) ,m(Googol,Googol,S) ,s2n(S,N) . 

Googol = [[[[[]]], [[[]]], []], [[], []], [], [], [], [[], []], [[]] I...], 
S= [[[[], []], [[[]]], []], [[[[]]]], [], [], [[]], [[[]]], [] I...], 
N = 100000000 00000000000000000000000000000000 

Let < T, a, m > denote the algebraic structure induced by the operations a and m on 
the set of multiway trees representing hereditarily finite sequences and < N, +, * > the 
corresponding algebraic structure on natural numbers with addition and multiplication. 
The following holds: 

Proposition 4 The addition and multiplication operations a/3 and m/3 induce an 
isomorphism between the semirings with commutative multiplication < N, +, * > and 
< T, a, m >. 

We conclude this first part of the paper by confessing that inventing (the asymp- 
totically efficient) Horn clause definitions of various arithmetic operations would not 
have been possible without the "reverse engineering" capabilities provided by the data 
transformation framework in [I], which has enabled us to move at will between rep- 
resentations like bijective base-2 binary numbers, bit-stacks, hereditarily finite sets, 
hereditarily finite sequences and watch the internal workings of ordinary operations 
through functors defined between these domains. 

While page limits do not allow us to describe this process in full detail, we have ex- 
tended these operations to cover, with asymptotic complexity comparable to standard 
bignum packages, to comparaisons, subtraction, division, powers etc. 

5 Computing with binary trees representing Godel's 
System T types 

Definition 3 In Godel's System T ^ a type is either N or t ^ s where t and s are 
types. 

The basic type A*' usually stands for the type of natural numbers. We will briefly 
show here that natural numbers can be emulated directly with types, by using a single 
constant e as basic type, representing 0. 

First, we observe that, guided by the known isomorphism between multiway and 
binary tree^ we can bring with a functor defined from hereditarily finite sequences 
to binary trees the definitions of s/2 and p/2 into corresponding definitions in the 
language of system T types, s_/2 and p_/2. 



s_ (e , (e— >e) ) . 




s_(((K^Ks)^Xs) , (e->-(Kl->Xs))) 


:- p_((K^Ks) , Kl) . 


s_((e->Xs), ((Kl-i>Ks)-5>Ys)) :-s_ 


(Xs, (K->Ys)), s_(K, (Kl^Ks)). 



^ That manifests itself in languages like Prolog or LISP as the dual view of lists as a 
representation of sequences or binary CONS-cell trees. 



P- 


((e- 


>e) , e) . 










P- 


((e- 


>(K^Xs)) , 


((Kl^Ks)- 


^Xs)) :- s_(K, 


(Kl- 


^Ks)) . 


P- 


(((K- 


-^Ks)^Xs) 


(e^Zs)) 


:- p_((K^Ks), 


Kl), 


p_((Kl^Xs) , Zs) . 



The following example illustrates that s_ and p_ work as expected: 



?- s_(e,One) ,s_(One,Two) , s_ (Two, Three) ,s_(Three, Four) ,p_ (Four, Three) . 

One = (e->e) , 
Two = ((e->e)->e). 
Three = (e->e->e) , 
Four = (((e->e)->e)->e) 

We will only give here the code of a generator n_/l for the infinite stream of 
natural numbers represented as types in system T, and a simple converter to usual 
natural numbers t2n, modeled after tree2nat/2. 

n_(e). 

n_(S) :-n_(P),s_(P,S) . 
t2n(e,0) . 

t2n((T->S) ,N) :-p_((T^S) ,U) ,t2n(U,M) ,N is M+-1 . 



confirming empirically that our computations mimic the usual ones: 

?- n_(T),t2n(T,N) . 

T = e, N = ; 

T = (e->e) , N = 1 ; 

T = ((e->e)->e) , N = 2 ; 

T = (e->e->e) , N = 3 ; 

T = (((e->e)->e)->e) , N = 4 ; 

Fast arithmetic computations, operating directly on types, can be derived using the 
corresponding code for hereditarily finite sequences as "boilerplate" . 

Deriving a bidirectional successor/predecessor predicate The predicates s_ and p_ 
are mutually recursive and structurally similar. Moreover, each of them would run re- 
versibly under a breadth-first evaluation order. An interesting challenge is to derive 
a bidirectional variant replacing both predicates. One could achieve this by using im- 
pure operations like nonvair/l to check which argument is instantiated or, equivalently, 
checking the instantiation of the arguments using negation as failure. We proceed by 
merging the two predicates' shared clauses and adding an extra argument taking the 
values up or down to indicate which way the the computation goes. 

sp(e, (e— >-e) , _) . 

sp(((K^Ks)^Xs) , (e^(KH-Xs)) ,Dir) :- 

f lip (Dir, Other) , 

sp(Kl, (K^-Ks) , Other). 
spCCe^Xs), ((Kl^Ks)->-Ys), up) :- 

sp(Xs, (K— >Ys) ,up), 

sp(K, (KH-Ks), up). 



sp( (e— >Xs) , 


((Kl- 


->Ks)— >Ys), down) :- 


sp(K, (Kl- 




down) , 


sp(Xs, (K- 


-4Ys) 


, down) . 



flip (up, down) . 
f lip(down,up) . 

up_or_down(_X,Y,down) :- \+(Y=other) . 
up_or_down(X,_Y,up) :- \+(X=other) . 

sp (X , Y) : -up_or_down (X , Y , Dir ) , sp (X , Y , Dir ) . 



Note also the auxiliary predicate flip/2, which indicates a change of direction, and the 

auxiliary predicate up_or_down, that choses among the two possible directions, based 
on the instantiation of at least one of the arguments of sp/2. We detect instantiation 
of the arguments testing them against the atom other, assumed not to be part of the 
Herbrand Universe of our program. 

One step further, we push the call to sp/3 into flip/2 (as it is the only continuation 
of flip/2), and merge the last two clauses, while delegating the ordering of the recursive 
calls to the auxiliary predicate order _sp. Note that we also fold up_or_down as part of 
the definition of sp/2. 



sp(e, (e— >-e), _) . 

sp(((K-J-Ks)->-Xs) , (e^-(KH-Xs)) , Dir) :-f lip_sp(Dir, Kl, (K->-Ks)). 
sp((e^-Xs), ((KH-Ks)^-Ys) , Dir) :-order_sp(Dir, Xs, (K^-Ys) , K, (KH-Ks)). 

flip_sp(up,X,Y) :- sp(X,Y,down) . 
f lip_sp(down,X,Y) :- sp(X,Y,up) . 

order_sp(up,A,B,C,D) :- sp(A,B,up), sp(C,D,up) . 
order_sp(down,A,B,C,D) :- sp(C,D,down) , sp(A,B,down) . 

sp(X,Y) :- \+(X=other), sp(X,Y,up) . 
sp(X,Y) :- \+(Y=other), sp(X,Y,down) . 



One can try out sp/2 working as a bidirectional successor/predecessor predicate when 
at least one of its arguments is instantiated: 

?- spCPred, ((e->e)->e)) . 
Pred = (e->e) . 

?- sp((e->e) ,Succ) . 
Succ = ((e->e)->e) . 

?- sp((e->e),((e -> e) -> e)). 
true. 



6 Mapping hereditarily finite sequences to parenthesis 
languages 



We will next explore the bijection between hereditarily finite sequences and the lan- 
guage of balanced parenthesis, known to combinatorialists [5I6I7| as a member of the 
Catalan family, which also includes the binary trees representing System T types. 

An encoder for the balanced parenthesis language is obtained by combining a parser 
and a writer, which, with some ingenuity, can be made one and the same in a language 
like Prolog. 

As hereditarily finite sequences naturally map one-to-one to parenthesis expressions 
expressed as bitstrings, we will choose them as target of the transformers. Our parser 
recurses over a bitstring (encoding balanced parentheses ' [' as 0, '] ' as 1) and builds 
a HFSEq tree T: 

pars_hf seq(Xs ,T) : -pars2term(0 , 1 ,T,Xs , [] ) . 
pars2term(L,R,Xs) — > [L] ,pars2args(L,R,Xs) . 
pars2args(_,R, [] ) — > [R] . 

pars2args(L,R, [X| Xs] ) — >-pars2term(L,R,X) ,pars2args(L,R,Xs) . 



Note that parsjif seq is bidirectional i.e. it works both as an encoder and a decoder: 

?- pars_hfseq([0,0,l,0,l,l] ,T) ,pars_hf seq(Ps,T) . 

T = [[] , []] , 

Ps = [0, 0, 1, 0, 1, 1] 

One can see the bijection defined by parsJif seq as a bridge between a family of formal 
languages and hereditarily finite sequences, represented as multiway trees. 

Kraft's inequality As the sequences computed by parsJifseq are elements of the 
balanced parenthesis language (also called Dyck primes) [S|, they implement uniquely 
decodable self- delimiting codes. Moreover, each of them is also a prefix code, i.e. there's 
no way to add a string made of any combination of balanced left or right parenthesis 
at the end of a code and obtain another code. For a similar reason, each of them is also 
a suffix code. Such codes are known in the literature under a variety of different names 
i.e. as reversible variable-length codes, bifix codes or fix-free code^ 

In particular, given that they are uniquely decodable codes, it follows that the Kraft 
inequality [9^ holds for them, i.e. if Zo, • • • ^fc • • • denote the length of the codes, then 

^2-'''<l (6) 

fc>0 

We define the function computing the left side of the Kraft inequality (called Kraft- 
sum), and the corresponding test as follows. 

^ A nice property of such codes is that parallel bidirectional decoding is possible. Also, 
the ability to decode from either the beginning or the end makes them suitable for 
encoding media streams. 



kraft_suiii(M,S) :- Ml is M-1, numlist (0 ,M1 ,Ns) , 
maplist(kraft_term,Ns,Ls) , 
sumlist (Ls ,S) . 

kraft_term(N,X) :-parsize(N,L) , X is 1/2"L. 

parsize(N,L) :- nat2hfseq(N,HFSEQ) , pars_hf seq(Xs ,HFSEQ) , length(Xs,L). 
kraf t_inequality (M) : -kraf t_suiii(M, S) ,S=Cl. 



The following example illustrates that the Kraft's inequality holds and it is likely that 
the Kraft-sum converges to a value below 0.5: 

?- maplist(kraft_suin, [10,100,1000,2000,3000,4000] ,R) . 

R = [0.364258, 0.382935, 0.390383, 0.391615, 0.392292, 0.392598] 

The bijection between hereditarily finite sequences and balanced parenthesis lan- 
guages provides a succinct alternative representation for purposes of efficient arithmetic 
operations using bitvector operations - by encoding the two parenthesis as and 1. As 
a possible practical application, this allows building in Prolog, at source level, a library 
supporting arbitrary length arithmetic operations. 



7 Related work 

Ranking functions can be traced back to Godel numberings |1UI11) associated to formu- 
lae. Together with their inverse unranking functions they are also used in combinatorial 
generation algorithms [12113] . Natural number encodings of hereditarily finite sets have 
triggered the interest of researchers in fields ranging from Axiomatic Set Theory and 
Foundations of Logic to Complexity Theory and Combinatorics |14I15I16] . 

The encodings of hereditarily finite sets and sequences described in this paper 
originate in [111711811^ . The key difference is that while in our previous work we use 
pairs of bijections encapsulated as higher order predicates/functions to define various 
isomorphisms directly, here we provide actual algorithms for arithmetic operations, 
ordering etc. while in our previous work the existence of such algorithms was only 
implied "non-constructively" . 

An emulation of Peano and conventional binary arithmetic operations in Prolog, is 
described in [2^ . Their approach is similar as far as a symbolic representation is used. 
The key difference with this paper is that our operations work on tree structures, and as 
such, they are not based on previously known algorithms. Our tree-based algorithms are 
also likely to support parallel execution in a way similar to the powerlists of [21]. Arith- 
metic computations with types expressed as C++ templates are described in [22] and 
in online articles by Oleg Kiselyov using Haskell's type inference mechanism. However, 
the mechanism advocated there is basically the same as j20J, focusing on Peano and 
binary arithmetics. The connection between hereditarily finite sequences and balanced 
parenthesis languages places them the context of the well known to combinatorialists 
Catalan families [5l6l7j . 



8 Conclusion 



We have derived a few algorithms expressing arithmetic computations symbohcally, in 
terms of hereditarily finite sequences and types in Godel's system T. 

This has been made possible by extending the techniques introduced in [T] that 
allow observing the internal working of intricate mathematical concepts through iso- 
morphisms transporting operations between fundamental data types. 

At the same time, we have shown that logic programming provides a flexible frame- 
work for modeling mathematical concepts from fields as diverse as combinatorics, for- 
mal languages, type theory and coding theory. 

Arithmetic operations with hereditarily finite sequences are likely to be interesting 
for hardware (FPGA) implementations of large integer operations used in cryptography. 
They are also subject to parallelization '2T' and can provide computations with giant 
numbers that do not fit in any computer memory with a flat bitstring representatioij^ 

Reversible variable length (bifix) codes like the ones we derived in section [6] have 
found uses in image and video coding [23) (including MPEG4!). Prefix codes are used 
in defining modern versions of Kolmogorov complexity |24|- The fact that this prop- 
erty holds, recursively, for arbitrary parts of the code, combined with their ability to 
express programming language constructs, as shown in [I], makes them an interesting 
alternative to the Elias codes [25j typically used in the field. 
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